Search CORE

85 research outputs found

La recerca lingüística en la TA

Author: Melero i Nogués Maite
Publication venue
Publication date: 01/01/2006
Field of study

La recerca lingüística pot contribuir molt al desenvolupament de la Traducció Automàtica, i al problema fonamental de les divergències en la traducció, amb observacions de fenòmens, amb tècniques i teories que la recerca en TA pot adoptar i combinar amb mètodes estadístics d'anàlisi de corpus.La investigación lingüística puede contribuir mucho al desarrollo de la Traducción Automática, y al problema fundamental de las divergencias en la traducción, con observaciones de fenómenos, con técnicas y teorías que la investigación en TA puede aportar y combinar con métodos estadísticos de análisis de corpus.Linguistic research has a great deal to contribute to the development of machine translation, as well as to the fundamental problem of discrepancies in translation, in the form of observations regarding phenomena, and of techniques and theories which research into MT can adopt and combine with statistical methods for the analysis of corpora

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Revistes Catalanes amb Accés Obert

Diposit Digital de Documents de la UAB

DIALNET

Holaaa!! Writin like u talk is kewl but kinda hard 4 NLP

Author: Domingo Judit
Marquina Montse
Melero Maite
Quixal Martí
Ruiz Costa-Jussà Marta
Publication venue
Publication date: 01/01/2012
Field of study

We present work in progress aiming to build tools for the normalization of User-Generated Content (UGC). As we will see, the task requires the revisiting of the initial steps of NLP processing, since UGC (micro-blog, blog, and, generally, Web 2.0 user texts) presents a number of non-standard communicative and linguistic characteristics, and is in fact much closer to oral and colloquial language than to edited text. We present and characterize a corpus of UGC text in Spanish from three different sources: Twitter, consumer reviews and blogs. We motivate the need for UGC text normalization by analyzing the problems found when processing this type of text through a conventional language processing pipeline, particularly in the tasks of lemmatization and morphosyntactic tagging, and finally we propose a strategy for automatically normalizing UGC using a selector of correct forms on top of a pre-existing spell-checker.Postprint (published version

UPCommons. Portal del coneixement obert de la UPC

El futur de les llengües en l’era digital: Oportunitats i bretxa lingüística

Author: Maite Melero Nogués
Publication venue: Escola d'Administració Pública de Catalunya
Publication date: 01/12/2018
Field of study

En aquest article reflexionem sobre com impactarà la revolució digital en la supervivència de les llengües en un futur no gaire llunyà. Si una cosa tenim clara és que el llenguatge humà serà el mitjà de comunicació predominant entre les persones i la tecnologia i entre les persones i el coneixement col·lectiu i la informació del món sencer. Efectivament, l’ús d’una llengua o d’una altra determina la quantitat d’informació a la qual es pot accedir, així com els serveis disponibles. La clau és el bagatge tecnològic amb què les diferents llengües s’enfronten al repte digital. La riquesa dels recursos tecnològics de cada llengua afectarà crucialment les seves possibilitats d’arribar amb bona salut al segle XXII. Les llengües en risc més immediat, evidentment, són aquelles afectades per la “diglòssia digital”: els parlants bilingües d’una llengua regional i d’una llengua de la globalització, abans que perdre el tren digital, opten per la llengua gran i deixen de banda la que no participa en el progrés tecnològic. Els efectes que això pot tenir en la diversitat lingüística de l’ecosistema digital, i per extensió en el món, són devastadors

Directory of Open Access Journals

Results from the ML4HMT-12 shared task on applying machine learning techniques to optimise the division of labour in hybrid machine translation

Author: Badia Toni
Costa-Jussá Marta
Federmann Christian
Melero Maite
Okita Tsuyoshi
van Genabith Josef
Publication venue
Publication date: 09/12/2012
Field of study

We describe the second edition of the ML4HMT shared task which challenges participants to create hybrid translations from the translation output of several individual MT systems. We provide an overview of the shared task and the data made available to participants before briefly describing the individual systems. We report on the results using automatic evaluation metrics and conclude with a summary of ML4HMT-12 and an outlook to future work

Irish Universities

DCU Online Research Access Service

English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach

Author: Casas Noe
Costa-jussà Marta R.
Melero Maite
Publication venue
Publication date: 01/01/2018
Field of study

This paper describes the methodology followed to build a neural machine translation system in the biomedical domain for the English-Catalan language pair. This task can be considered a low-resourced task from the point of view of the domain and the language pair. To face this task, this paper reports experiments on a cascade pivot strategy through Spanish for the neural machine translation using the English-Spanish SCIELO and Spanish-Catalan El Peri\'odico database. To test the final performance of the system, we have created a new test data set for English-Catalan in the biomedical domain which is freely available on request.Comment: Full workshop proceedings can be found at https://multilingualbio.bsc.es/wp-content/uploads/2018/03/LREC-2018-PROCEEDINGS-MultilingualBIO.pd

arXiv.org e-Print Archive

UPCommons. Portal del coneixement obert de la UPC

Cas d'integració de la TA : Microsoft

Author: Melero i Nogués Maite
Publication venue
Publication date: 01/01/2006
Field of study

Es presenta el sistema MSR-MT, un sistema híbrid de TA desenvolupat pel grup de Processament de Llenguatge Natural a Microsoft Research, gràcies al qual es podran traduir automàticament a diverses llengües, tots els articles encara no traduïts de la base de coneixement desenvolupada pels Serveis de Suport de Productes (Product Support Services, PSS) de Microsoft.Se presenta el sistema MSR-MT, un sistema híbrido de TA desarrollado por el grupo de Procesamiento de Lenguaje Natural de Microsoft Research, gracias al cual se podrán traducir automáticamente a diversas lenguas, todos los artículos aún sin traducir de la base de conocimiento desarrollada por los Servicios de Soporte de Productos (Product Support Services, PSS) de Microsoft.This article presents the MSR-MT system, a hybrid MT system developed by Microsoft Research's Natural Language Processing group. MSR-MT will make it possible to automatically translate all the as-yet untranslated articles in the knowledge base developed by Microsoft's Product Support Services (PSS) to different languages

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Revistes Catalanes amb Accés Obert

Diposit Digital de Documents de la UAB

DIALNET

TRADE (MLAP93/003)

Author: Bel Rafecas Núria
Melero Nogués Maite
Publication venue: Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)
Publication date: 21/02/2019
Field of study

La Traducción Automática se considera una de las aplicaciones más importantes de la Ingeniería Lingüística, desde el punto de vista comercial. A pesar de que el problema de la TA está lejos de haber sido resuelto, se pone de manifiesto la necesidad de disponer de productos operativos que cubran, al menos parcialmente, la demanda del mercado en este sentido, proporcionando ayudas y herramientas para la traducción..

Diposit Digital de la Universitat de Barcelona

Transfer Learning with Shallow Decoders: BSC at WMT2021’s Multilingual Low-Resource Translation for Indo-European Languages Shared Task

Author: Armengol Estapé Jordi
Gibert Bonet Ona de
Kharitonova Ksenia
Melero Maite
Rodríguez i Alvarez Mar
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2021
Field of study

This paper describes the participation of the BSC team in the WMT2021{'}s Multilingual Low-Resource Translation for Indo-European Languages Shared Task. The system aims to solve the Subtask 2: Wikipedia cultural heritage articles, which involves translation in four Romance languages: Catalan, Italian, Occitan and Romanian. The submitted system is a multilingual semi-supervised machine translation model. It is based on a pre-trained language model, namely XLM-RoBERTa, that is later fine-tuned with parallel data obtained mostly from OPUS. Unlike other works, we only use XLM to initialize the encoder and randomly initialize a shallow decoder. The reported results are robust and perform well for all tested languages.Postprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Soluble ST2 levels and left ventricular structure and function in patients with metabolic syndrome

Author: Beunza Maite
Cardelli Patrizia
Celic Vera
Di Somma Salvatore
Escribano Elena
Floridi Federico
Lopez-Andres Natalia
Magrini Laura
Majstorovic Anka
Marino Rossella
Melero Amaia
Pencic-Popovic Biljana
Roy Ignacio
Salerno Gerardo
Sljivic Aleksandra
Publication venue: 'Korean Society for Laboratory Medicine (KAMJE)'
Publication date: 01/01/2016
Field of study

Background: A biomarker that is of great interest in relation to adverse cardiovascular events is soluble ST2 (sST2), a member of the interleukin family. Considering that metabolic syndrome (MetS) is accompanied by a proinflammatory state, we aimed to assess the relationship between sST2 and left ventricular (LV) structure and function in patients with MetS. Methods: A multicentric, cross-sectional study was conducted on180 MetS subjects with normal LV ejection fraction as determined by echocardiography. LV hypertrophy (LVH) was defined as an LV mass index greater than the gender-specific upper limit of normal as determined by echocardiography. LV diastolic dysfunction (DD) was assessed by pulse-wave and tissue Doppler imaging. sST2 was measured by using a quantitative monoclonal ELISA assay. Results: LV mass index (β=0.337, P<0 .001, linear regression) was independently associated with sST2 concentrations. Increased sST2 was associated with an increased likelihood of LVH [Exp (B)=2.20, P=0.048, logistic regression] and increased systolic blood pressure [Exp (B)=1.02, P=0.05, logistic regression]. Comparing mean sST2 concentrations (adjusted for age, body mass index, gender) between different LV remodeling patterns, we found the greatest sST2 level in the group with concentric hypertrophy. There were no differences in sST2 concentration between groups with and without LV DD. Conclusions: Increased sST2 concentration in patients with MetS was associated with a greater likelihood of exhibiting LVH. Our results suggest that inflammation could be one of the principal triggering mechanisms for LV remodeling in MetS

PubMed Central

Archivio della ricerca- Università di Roma La Sapienza

The strategic impact of META-NET on the regional, national and international level

Author: Ananiadou Sophia
Branco Antonio
Hajic Jan
Hernáez Inma
Mariani Joseph
McNaught John
Melero Maite
Monachini Monica
Moreno Bilbao M. Asunción
Odijk Jan
Piperidis Stelios
Rosner Mike
Skadina Inguna
Tadic Marko
Thompson Paul
Tufis Dan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

This article provides an overview of the dissemination work carried out in META-NET from 2010 until 2015; we describe its impact on the regional, national and international level, mainly with regard to politics and the funding situation for LT topics. The article documents the initiative's work throughout Europe in order to boost progress and innovation in our field.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC